Picture for Dietrich Klakow

Dietrich Klakow

Split Personality Training: Revealing Latent Knowledge Through Alternate Personalities

Add code
Feb 05, 2026
Viaarxiv icon

AmharicStoryQA: A Multicultural Story Question Answering Benchmark in Amharic

Add code
Feb 02, 2026
Viaarxiv icon

PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks

Add code
Oct 14, 2025
Figure 1 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Figure 2 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Figure 3 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Figure 4 for PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Viaarxiv icon

Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches

Add code
Aug 29, 2025
Figure 1 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Figure 2 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Figure 3 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Figure 4 for Accept or Deny? Evaluating LLM Fairness and Performance in Loan Approval across Table-to-Text Serialization Approaches
Viaarxiv icon

Voice Conversion Improves Cross-Domain Robustness for Spoken Arabic Dialect Identification

Add code
May 30, 2025
Viaarxiv icon

Charting the Landscape of African NLP: Mapping Progress and Shaping the Road Ahead

Add code
May 28, 2025
Viaarxiv icon

Attention on Multiword Expressions: A Multilingual Study of BERT-based Models with Regard to Idiomaticity and Microsyntax

Add code
May 09, 2025
Viaarxiv icon

Colombian Waitresses y Jueces canadienses: Gender and Country Biases in Occupation Recommendations from LLMs

Add code
May 05, 2025
Viaarxiv icon

Evaluating Grounded Reasoning by Code-Assisted Large Language Models for Mathematics

Add code
Apr 24, 2025
Viaarxiv icon

Agree to Disagree? A Meta-Evaluation of LLM Misgendering

Add code
Apr 23, 2025
Viaarxiv icon